A collection of parallel linear equations routines for the Denelcor HEP

نویسندگان

  • Jack J. Dongarra
  • Robert E. Hiromoto
چکیده

This paper describes the implementation and performance results for a few standard linear algebra routines on the Denelcor HEP computer. The algorithms used here are based on high-level modules that facilitate portability and perform efficiently in a xvide range of environments:The modules are chosen to be of a large enough computational granularity so that reasonably optimum performance may be insured. The design of algorithms with such fundamental modules in mind will also facilitate their replacement by others more suited to gain the desired performance on a particular computer architecture. We have been using the Denelcor HEP (Heterogenous Element Processor) to implement a modest set of parallel routines to handle some common problems that arise when dealing with dense matrices in linear algebra: matrix multiplication, Cholesky decomposition of a positive definite matrix, LU factorization with partial pivoting, and QR factorization of a general matrix. Jordan [3] describes the architecture and programming environment of the Denelcor HEP, and Stewart [5] provides a complete description of the algorithms discussed here. Part of the experiment was to examine the ease of taking a collection of algorithms, expressed in terms of high-level modules, and implementing then on a computer with parallel constructions, such as the Denelcor HEP. Our hope was to gain near-optimum performance from these routines by implementing only the underlying modules using parallel constructs. We look on our experience as an experiment in producing portable algorithms that have a high level of granularity in their structure and high performance on a wide variety of computer architectures. The basic algorithms used here are the same as those reported in a paper by Dongarra and Eisenstat [1] (with the exception of QR factorization). These algorithms are based on standard procedures in linear algebra. They have been written to retain much of the original .mathemati-cal formulation and are based on matrix-vector operations. Designing the algorithms in terms of such operations is the hard part of an implementation. By understanding the algorithm in terms

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Parallel Iterative Methods PIM package for the solution of systems of linear equations on parallel computers

We present a collection of public domain Fortran routines for the solution of sys tems of linear equations using a variety of iterative methods The routines implement methods which have been modi ed for their e cient use on parallel architectures with either shared or distributed memory PIM was designed to be portable across di erent machines Results are presented for a variety of parallel comp...

متن کامل

Parallel Adaptive Numerical Schemes for Hyperbolic Systems of Conservation Laws*

We generalize the first author’s adaptive numerical scheme for scalar first order conservation laws to systems of equations. The resulting numerical methods generate highly non-uniform, time-dependent grids, and hence are difficult to execute efficiently on vector computers such as the Cray or Cyber 205. In contrast, we show that these algorithms may be executed in parallel on alternate compute...

متن کامل

A reliable linear algebra library for transputer networks

This paper presents a collection of linear algebra subroutines for transputer networks The developed pilot library is intended to tbrm a basis of a complete parallel linear algebra library fi~r validating computations, whose routines will deliver [as accurately as necessary) eithex the best possible result, or a corresponding inclusion based on controlled rounding and an optimal scalar product ...

متن کامل

SNAP (Small-World Network Analysis and Partitioning) Framework

Discussion Both LAPACK and ScaLAPACK libraries contain routines for solving systems of linear equations, least squares problems, and eigenvalue problems. The goals of both projects are efficiency (to run as fast as possible), scalability (as the problem size and number of processors grow), reliability (including error bounds), portability (across all important parallel machines), flexibility (s...

متن کامل

The HEP Parallel Processor

A lthough there is an abundance of concepts for parallel computing, there is a dearth of experimental data delineating their strengths and weaknesses. Consequently, for the past three years personnel in the Laboratory’s Computing Division have been conducting experiments on a few parallel computing systems. The data thus far are uniformly positive in supporting the idea that parallel processing...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Parallel Computing

دوره 1  شماره 

صفحات  -

تاریخ انتشار 1984